Which distributional cues help the most? Unsupervised contexts selection for lexical category acquisition
نویسندگان
چکیده
Starting from the distributional bootstrapping hypothesis, we propose an unsupervised model that selects the most useful distributional information according to its salience in the input, incorporating psycholinguistic evidence. With a supervised Parts-of-Speech tagging experiment, we provide preliminary results suggesting that the distributional contexts extracted by our model yield similar performances as compared to current approaches from the literature, with a gain in psychological plausibility. We also introduce a more principled way to evaluate the effectiveness of distributional contexts in helping learners to group words in syntactic categories.
منابع مشابه
The Role of Distributional Information in Linguistic Category Formation
A crucial component of language acquisition involves organizing words into grammatical categories and discovering relations between them. Many studies have argued that phonological or semantic cues or multiple correlated cues are required for learning. Here we examine how distributional variables will shift learners from forming a category of lexical items to maintaining lexical specificity. In...
متن کاملFrom shared contexts to syntactic categories: the role of distributional information in learning linguistic form-classes.
A fundamental component of language acquisition involves organizing words into grammatical categories. Previous literature has suggested a number of ways in which this categorization task might be accomplished. Here we ask whether the patterning of the words in a corpus of linguistic input (distributional information) is sufficient, along with a small set of learning biases, to extract these un...
متن کاملDistributional learning and lexical category acquisition: What makes words easy to categorize?
In this study, results of computational simulations on English child-directed speech are presented to uncover what distributional properties of words make it easier to group them into lexical categories. This analysis provides evidence that words are easier to categorize when (i) they are hard to predict given the contexts they occur in; (ii) they occur in few different contexts; and (iii) thei...
متن کاملInequality between the classes: Phonological and distributional typicality as predictors of lexical processing
Information about the syntactic category of a word can be derived from a number of complementary sources. We focus here on phonological and distributional cues for distinguishing nouns and verbs that have been proposed as useful for language acquisition. In this paper we assessed the extent to which this information affects lexical processing in adults. We hypothesised that the phonological or ...
متن کاملLexical Category Acquisition as an Incremental Process
Psycholinguistic studies suggest that early on children acquire robust knowledge of the abstract lexical categories such as nouns, verbs and determiners (e.g., Gelman & Taylor, 1984; Kemp et al., 2005). Children’s grouping of words into categories might be based on various cues, including the phonological and morphological properties of a word, the distributional information about its surroundi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015